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DETAILED ACTION 

1 . This Office Action is in response to communication filed 03/03/2006. Claims 1 , and 3-1 5 are 
pending. Claim 2 is canceled. 

Response to Arguments 

2. Applicant's arguments with respect to claims 1 , and 3-15 have been considered but are moot in 
view of the new ground(s) of rejection. 



Claim Rejections - 35 (JSC § 103 

3. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all obviousness 

rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

4. Claims 1 and 3-15 are rejected under 35 U.S.C. 103(a) as being unpatentable over Ladd et al. 
(6,269,336 filed 10/2/1998) in view of Chung et al (6,1 15,686). 

The table below summarizes the limitations of these claims and teachings in Ladd et al. that meet these 
limitations. 



Claim 
# 


Limitations 


3^ ^l^.iS-JI?!^ 


1 


A document processing apparatus comprising: 

document obtaining means for obtaining a 
document written in a predetermined markuD 
lanauaae from a desianated source 
rule identification information extraction means .. 
the obtaining means 


The network access aDDaratus of the 
system allows the user to access (i.e., view 
and/or hear) the information retrieved from 
the information source. (Col. 3, lines 40- 
42). The information can be stored in a 
database of the information source and 
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can include text content, markup lanauaae 






document or pages (Col 11, lines 42-45). 






The nested HTML tags 




rule selectina means for selectina a rule defining 


The parser unit receives the information 




voice inDut/outout contents from a plurality of 


from the network fetcher unit and parses 




predetermined rules 


the information according to the syntax 






rules of the markup lanauaae. (Column 12, 






lines 18-20) The markup language can 






include elements that describe the 






structure of a document or paae, provide 






pronunciation of words and phrases, and 






place markers in the text to control 






interactive voice services. The markup 






language also provides elements that 






control phrasing, emphasis, pitch, speaking 






rate, and other characteristics. (Column 






16, 12-18 and FIG. 6) As seen from FIG. 6, 






the <DIALOGUE> section contains both 






input candidates and output contents, 






which may also include instructions to fetch 






additional elements via SQL calls. (Col. 41, 






lines 45-50) 




document analyzing means for analyzing a 


The interpreter unit determines the next 




designated range of the document obtained by 


state or step based upon the structure of 




said document obtaining means based on the rule 


the dialoa and the inputs from the user. 




selected by said rule selecting means to fetch 


When the interpreter unit transitions to a 




voice output contents, voice input candidates, and 


new dialog or page, the address of the new 




desianation information for desianatina a next 


dialoa or paae is then sent to the network 




Drocessina object corresponding to each voice 


fetcher. (Column 13. lines 55-59) 




input candidate f 
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voice outputtina means for voice-outputtina the 


The TTS unit of the VRU server receives 




voice outDut contents fetched bv said document 


textual data or information... The TTS unit 




analyzing means 


processes the textual data and converts 






the data to voice data or information. 






(Column 9, lines 3-10) 




voice recoanizina means for voice-recoanizina 


The ASR unit of the VRU server provides 




the voice input bv the user 


speaker independent automatic speech 






recoanition of speech inputs or 






communications from the user. (Column 






9, lines 27-30) 




controllina means for checkina the result of 


The interpreter unit can transition from 




recoanition bv said voice recoanizina means 


state to state (i.e., step to step) within a 




against the input candidates fetched by said 


tree structure (i.e., a dialog) of a markup 




document analvzina means to control obtainment 


language document or can transition to a 




of a new document bv said document obtainina 


new tree structure within the same dialog 




means or next analvsis bv said document 


or another dialog. The interpreter unit 




analyzing means based on designation 


determines the next state or step based 




information corresponding to the input candidate 


upon the structure of the dialog and the 




matching the recognition result. 


inputs from the user. When the interpreter 






unit transitions to a new dialoa or paae, the 






address of the new dialog or page is then 






sent to the network fetcher. (Column 13, 






lines 52-59) 



It is noted that Ladd teaches the claimed invention but does not explicitly teach a rule selecting 
means for selecting a rule corresponding to the extracted rule identification information ...identification 
information stored in a memory... in the obtained document. However, this feature is well known in the art 
as evidenced by Chung et al who teach in figures 1-5b, the HTML parser 24 outputs the HTS control rules 
to the HTS control parser 22 wherein the HTS control parser can receive four different types of rules 
(intonation, audio data rule, enunciation rule and terminology translation rule). Furthermore, the HTS 
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control rules 1 10-180 embedded in the HTML comment tag wherein the rules (110-180) are designated 
by identifiers, (col. 6, lines 9-61). Therefore, it would have been obvious to one of ordinary skill in the art 
at the time the invention was made to incorporate into Ladd the rule selection means as taught by Chung 
et al because it would provide an system that can accommodate HTML documents with nested HTML 
textual tags and enunciate symbols correctly depending on context. 

As per claim 4, Chung et al teach rule selecting means selects a predetermined rule if said rule 
identification information extraction means cannot ...in the obtained document 9 his HTS parser 22, his 
tag mapping table 41 , his parameter 42 and his audio table 43). 



Ladd et al further teach: 









3 


The document processing apparatus according 
to claim 1 . wherein said rule identification 
information is a predetermined attribute value of a 
predetermined taa. 


markup lanauaae document includes taas 
(Column 16, line 29-31) 


4 


The document processing apparatus according 
to claim 1, wherein said rule selecting means 
selects a predetermined rule if the rule 
identification information is not described in the 
obtained document. 


If a pre-existing grammar is not found at 
block, the voice browser dynamically 
generates the grammar for the user input. 
The voice browser looks up the 
pronunciations for the user in a dictionary. 
(Column 14, lines 29-33) 


5 


The document processing apparatus according 
to claim 1 , wherein said document analyzing 
means fetches as said designation information a 
source from which a next document is obtained. 


When the interpreter unit transitions to a 
new dialog or page, the address of the new 
dialoa or paae is then sent to the network 
fetcher. (Column 13, lines 55-59) The 
network fetcher unit retrieves information, 
including markup language documents, 
audio samples and grammars from the 
information sources. (Column 12, lines 
10-14) 


6 


The document processing apparatus according to 
claim 1 , wherein said document analyzing means 


The network fetcher unit retrieves 
information, including markup lanauaae 
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fetches an analyzed ranae of a next document as 
said designation information. 


documents (Column 12, lines 10-14). 

Since network fetcher can retrieve full 
documents, it can inherently retrieve 
multiple documents specified in the 
analyzed range of a next document. 


7 


The document processing apparatus according 
to claim 1 , wherein said rule selecting means 
selects a rule based on instructions from a user. 


The communication node can also allow 
the user to select a particular speech 
recognition model. (Column 6. lines 25- 
36) or choose models based on 
<PROFILE> tag information (Col. 24, lines 
12-65) 


9 


The document processing apparatus according 
to claim 1 . wherein said Dluralitv of rules includes 
a rule which defines a predetermined attribute 
value of a predetermined taa as voice output 
contents, and contents surrounded bv 
predetermined second taas as input candidates, 
in said document. 


The PROMPT element of the markup 
lanauaae is used to define content (i.e.. 
text or an audio file) that is to be presented 
to the user. (Column 18, line 32-36). 
The INPUT element of the markup 
lanauaae is used to define a valid user 
input within each STEP element. (Column 
18, line 56-58) 


10 


The document processing apparatus according to 
claim 9, wherein in said rule, if said recognition 
result matches an input candidate, contents 
ranging from the contents surrounded by said 
second predetermined tags which correspond to 
the input candidate up to a third predetermined 
taa are defined as next voice output contents, and 
an anchor in the voice output contents is defined 
as a next input candidate. 


See example (Column 16, line 63 - 
Column 17, line 15). The page consists of 
one rule (DIALOG) encompassing 
PROMPT elements that define voice output 
contents and INPUT elements that define 
input candidates. The nature of the markup 
language is such that these elements can 
be arranged in a variety of configurations 
that limit claim 11. 


11 


The document processing apparatus according to 
claim 1, wherein said plurality of rules includes a 
rule which defines contents ranging from the head 
of said document to a predetermined taa as voice 
output contents, and an anchor in the voice output 
contents as an input candidate. 


See example (Column 16, line 63 - 
Column 17, line 15). The page consists of 
one rule (DIALOG) encompassing 
PROMPT elements that define voice output 
contents and INPUT elements that define 
input candidates. The nature of the markup 
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language is such that these elements can 






be arranged in a variety of configurations 






that limit claim 1 1 . 


12 


The document processing apparatus according to 


The telecommunication network is 




claim 1, wherein said voice input and voice output 


preferably connected to the communication 




are performed throuah a telephone line. 


node via a high-speed data link, such as, a 






T1 telephone line. (Column 5. lines 39-42) 


13 


A document processing method comprising: 






a document obtaining step of obtainina a 


The network access apparatus of the 




document written in a predetermined markup 


svstem allows the user to access (i.e.. view 




lot lyUaLjt? liUIIl a UColljl IdlCU OUUIUC IIUIII WIIIOII 


sinri/nr h^ar^ thp infnrmatinn rAtrip\/pH from 

GlIU/UI 1 ICQI f LI IC II 1 IfJI 1 1 IOIII/I 1 ICU IvVCU II Will 




the document is to be obtained 


the information source. (Col. 3, lines 40- 






42). The information can be stored in a 






database of the information source and 






can include text content, markup lanauaae 






document or pages (Col 11, lines 42-45) 




a IUIC oclClrMI l(J olC|J UI oclCOlll 11^ a IUIC UCllllilllJ 


i lie pel loci unit i cucivco 11 ic ii uui 1 1 iciuui i 




voice input/output contents from a plurality of 


from the network fetcher unit and parses 




predetermined rules 


the information according to the svntax 






rules.of the markup language. (Column 12, 






lines 18-20) See definition of markup 






language at Column 16, 12-18. 




a document analvzina steD of analvzina a 


The interpreter unit carries out a dialog with 




desianated ranae of the document obtained in 


the user based upon the tree structure 




said document obtaining step based on the rule 


representing a markup lanauaae 




selected in said rule selectina step to fetch voice 


document. (Column 13, lines 45-47) 




output contents, voice input candidates, and 


When the interpreter unit transitions to a 




designation information for designating a next 


new dialog or page, the address of the new 




processing object corresponding to each voice 


dialoa or paae is then sent to the network 




input candidate 


fetcher. (Column 13. lines 55-59) 
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a voice outputtina step of voice-outDuttina the 
voice output contents fetched in said document 
analyzing step 

a voice recoanizina step of voice-recoanizina the 
voice input from the user 

and a controlling step of checking the result of 
recognition by said voice recognizing step against 
the input candidates fetched in said document 
analvzina steD to control obtainment of a new 
document bv said document obtainina step or 
next analvsis bv said document analvzina step 
based on designation information corresponding 
to the input candidate matchina the recoanition 
result. 


The TTS unit of the VRU server receives 
textual data or information... The TTS unit 
processes the textual data and converts 
the data to voice data or information. 
(Column 9, lines 3-10) 

The ASR unit of the VRU server provides 
speaker independent automatic speech 
recoanition of speech inputs or 
communications from the user. (Column 
9, lines 27-30) 

The interpreter unit can transition from 
state to state (i.e., step to step) within a 
tree structure (i.e., a dialog) of a markup 
language document or can transition to a 
new tree structure within the same dialog 
or another dialog. The interpreter unit 
determines the next state or step based 
upon the structure of the dialoa and the 
inputs from the user. When the interpreter 
unit transitions to a new dialoa or paae. the 
address of the new dialog or page is then 
sent to the network fetcher. (Column 13, 
lines 52-59). 















5. Claims 13-15 are the same in scope and content as claim 1 above and therefore are rejected 
under the same rationale. 

The combination of Ladd with Chung does not explicitly teach assigning priorities to rules and 
choosing rules based on their respective priorities. 
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However, the examiner takes the official notice that it is well-known in the art of speech 
recognition to assign priorities to speech models (which are part of the rules specified by the XML 
document in Ladd et al.'s invention) in speech recognition systems in order to make the selection process 
of required speech models more flexible to the user's requirements. 

Therefore, it would have been obvious to one of ordinary skill in the art at the time the invention 
was made to modify the combination of Ladd et al. with Chung to assign priorities and choose rules 
based on assigned priorities because this would enable the system to be more flexible to the user's 
requirements and choose a rule that would best fit the situation. 

6. Applicant's amendment necessitated the new ground(s) of rejection presented in this Office 
action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP § 706.07(a). Applicant is reminded of 
the extension of time policy as set forth in 37 CFR 1 .136(a). 

A shortened statutory period for reply to this final action is set to expire THREE MONTHS from the 
mailing date of this action. In the event a first reply is filed within TWO MONTHS of the mailing date of 
this final action and the advisory action is not mailed until after the end of the THREE-MONTH shortened 
statutory period, then the shortened statutory period will expire on the date the advisory action is mailed, 
and any extension fee pursuant to 37 CFR 1 .136(a) will be calculated from the mailing date of the 
advisory action. In no event, however, will the statutory period for reply expire later than SIX MONTHS 
from the date of this final action. 

7. Any inquiry concerning this communication or earlier communications from the examiner should 
be directed to Patrick N. Edouard whose telephone number is 7033086725. The examiner can normally 
be reached on M-TH 7:30-6:00. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's supervisor, David 
Hudspeth can be reached on 571 272 7843. The fax phone number for the organization where this 
application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the Patent Application 
Information Retrieval (PAIR) system. Status information for published applications may be obtained from 
either Private PAIR or Public PAIR. Status information for unpublished applications is available through 
Private PAIR only. For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic Business Center (EBC) 
at 866-217-9197 (toll-free). 
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